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ABSTRACT 

The  lower  bound  on  the  specified  P*  for  the  Gupta  procedure 
for  selecting  a  subset  containing  the  best  of  k  populations,  and 
for  the  Gupta-Sobel  procedure  for  selecting  a  subset  containing 
all  populations  at  least  as  good  as  a  control  population  is  studied 
via  impartial  "no  data"  minimax  decision  rules.  Gibbons,  Olkin, 
and  Sobel  (1977)  state  that  a  theoretical  lower  bound  is  1/2 
for  both  problems.  Our  analysis  shows  (a)  that  1/k  is  the  cor- 
rect  lower  bound  for  the  first  problem,  and  (b)  that  1/2  is  the 
correct  lower  bound  for  the  second  problem  provided  that  a  partic¬ 
ular  loss  function  is  adopted.  Other  (reasonable)  choices  of  loss 
function  lead  to  different  lower  bounds  for  the  second  problem. 


1.  SUBSET  SELECTION  OF  THE  BEST  POPULATION 

Gupta’s  subset  selection  procedure  (Gupta,  1956,  1965) 
selects  a  non-empty  (small)  subset  from  k  j>  2  populations  such 
that  the  probability  is  at  least  equal  to  a  specified  value  P* 
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that  the  "best1*  population  is  contained  in  the  selected  subset. 

His  1965  paper  mentions  (p.  230)  that  "...for  values  of  P*  <  1/k 

there  always  exists  a  no  data  decision  rule."  In  their  recent  text 

Gibbons,  Olkin  and  Sobel  (1977)  (GOS)  state  that  the  theoretical 

k 

lower  bound  for  P*  is  1/2  .  This  note  gives  arguments  showing 

that  Gupta* s  value  of  1/k  is  the  correct  lower  bound. 

For  simplicity,  the  computations  that  follow  assume  that  if 

0^  is  the  parameter  characterizing  the  " goodness"  of  the  ith 

population  (1  <  i  <  k),  then  the  values  of  the  elements  of  £  = 

(^1*^2* *'* *^k)  6  ®  are  distinct  so  that  there  is  a  unique  best 

population.  A  correct  selection  is  made  only  when  the  selected 

subset  contains  the  best  population. 

In  the  "no  data"  situation  a  selection  rule  is  described  in 

terms  of  a  sampling  scheme  {p^|l  £  i  £  k*  1  <  j  <  (.)}  for 

choosing  a  non-empty  subset.  Here  p..  denotes  the  probability 

k 

of  selecting  the  jth_  subset  of  size  i  where  the  (^)  subsets 
are  written  in  some  fixed  order.  We  assume  that  the  statistician 
will  use  an  impartial  (i,e.,  invariant  wrt  the  group  of  permuta¬ 
tions)  rule  (Bahadur  and  Goodman,  1952  and  Eaton,  1967);  it  is 
easy  to  check  that  for  the  case  of  zero-one  loss  this  is  equiva¬ 
lent  to  requiring  that  p . ,  =  p . .  for  1  <  i  <  k  and 

1  <,  , j 2  £  (*)  <**>  P£j  =  Pj/(*)  for  1  £  *  £  1  £  3  £  (*) 

where  p  =  (p,  ,p„,. . .  ,p,  )  satisfies  p.  >0(l<i<k)  and 
y  ^  1  ^  K  1  ~  *=* 

£  p.  =  1.  Hence,  impartial  decision  rules  operate  in  two  inde- 
i=l  1 

pendent  stages:  First  a  subset  of  size  i  is  chosen  according  to 

k 

and  then  one  of  the  (^)  subsets  of  size  i  is  chosen  at 

random.  Both  the  expected  subset  size,  E.tslp},  and  the  proba- 

£  ^ 

bility  of  a  correct  selection,  P  {CS|^},  are  independent  of  jg 

for  invariant  no-data  rules  p.  We  obtain  P{CS|p}  =  inf  P.{CS|p} 
k  *  k  *  «  K  * 

-  T  p.(i/k),  and  E{s|p}  =  sup  E.{S|p}  =  £  ip.  =  kP{CS|p}. 

i=i  1  *  n  *  *  i=i  1  K 

Any  choice  of  P*  «  [1/k,  1]  can  be  attained  exactly  by  an  appro¬ 
priate  rule.  Table  I  lists  the  values  of  P{CS|jd}  and  E{S|^} 
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TABLE  I 


Some  Possible  Choices  of  jd,  and  Associated 
Performance  Characteristics 


sup  Ea{S|p} 

n  £  * 

Pk s  1 

1 

k 

p^  =  1/k  (1  £  i  ^k) 

(k+l)/2k 

(k+l)/2 

k2k_1/(2k-l) 

Pi  =  1 

1/k 

1 

for  several  jd. 

Suppose  that  E.{s|$}  is  regarded  as  the  risk  associated 
with  an  arbitrary  rule  6.  The  rule  p^  -  1  is  clearly  the  uni¬ 
formly  minimum  risk  procedure  in  the  class  of  no-data  rules;  the 
value  of  P(CS|jd}  associated  with  p^  =  1  is  1/k.  In  data 
problems  the  P*  condition,  inf  Pn(CS|6}  >  P*,  specifies  a 
class  of  rules,  C(P*),  to  be  studied.  The  rule  p^  =  1  is  in 
C(P*)  for  all  P*  4  1/k  and  hence  it  is  a  uniformly  minimum  risk 
rule.  P*  must  be  chosen  greater  than  1/k  for  the  problem  to 
require  a  data-dependent  solution  under  the  risk  Ett{s|6). 

Remark  1.1.  One  could  argue  that  the  risk  E„{S|<5}  is  deficient 
and  that  it  would  be  better  to  drop  the  P*  condition  and  adopt 
a  loss  function  which  takes  into  account  both  S  and  the  event  of 
correct  selection.  Goel  and  Rubin  (1977)  and  Chernoff  and  Yahav 
(1977)  give  some  examples  of  this  latter  approach.  Our  basic 
analysis  is  closer  in  spirit  to  that  of  classical  statistical 
theory  in  which  an  optimal  procedure  is  selected  from  a  subclass 
of  all  procedures  defined  by  some  condition  such  as  the  size  of  a 
test  or  the  unbiasedness  of  an  estimator. 

Remark  1.2.  The  statement  on  p.  297  of  GOS  to  the  effect  that 

k 

"...the  probability  1/2  can  be  attained  by  simply  tossing  a 
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fair  coin  for  each  population  to  determine  whether  to  include  it 

in  the  subset  or  not"  is  incorrect.  This  rule,  which  permits  the 

choice  of  an  empty  subset  (an  option  which  is  not  allowed  for  this 

problem),  yields  Pa(CS>  -  1/2. 

£ 

Remark  1.3.  GOS  state  (p.  297)  that  P*  should  be  chosen  greater 
than  (1/2  +  l/2k)  for  applied  work.  Practitioners  who  follow 
the  advice  will  automatically  satisfy  the  correct  theoretical 
bound  since  1/2  +  l/2k  >  1/k  for  k  :>  2. 

2.  SUBSET  SELECTION  WITH  RESPECT  TO  A  CONTROL 

The  Gupta-Sobel  subset  selection  procedure  (Gupta  and  Sobel, 
1958)  is  used  for  selecting  a  subset  (possibly  empty)  of  k  (k^l) 
populations  containing  all  those  populations  at  least  as  good  as 
a  control.  In  the  sequel  below  we  call  such  populations  "optimal." 
A  correct  selection  is  said  to  occur  if  and  only  if  the  above 
stated  goal  is  achieved.  GOS  states  (pp.  307,310)  that  the  lower 
bound  for  P*  is  1/2^. 

Our  analysis  of  the  present  problem  will  differ  from  that  of 

the  previous  one.  The  reason  is  that  while  S  must  clearly  be 

minimized  in  Section  1,  its  role  in  the  present  problem  is  more 

k 

complicated.  If  we  write  the  parameter  space  ft  as  u  ft.  where 

j=0  3 

ftj  (0  <i  j  <,  k)  represents  those  points  for  which  exactly  j 
populations  are  at  least  as  good  as  the  control,  then  when  £  €  ft^ 
we  would  hope  that  the  selected  subset  contains  all  j  optimal 
populations  and  no  others  (in  which  case  S  =  j).  In  particular, 
when  £  €  ftQ  we  would  hope  S  =  0.  Many  loss  functions  can  be 
proposed  which  embody  the  above  notion;  below  we  show  that  l/2k 
is  the  lower  bound  of  the  infimum  of  the  probability  of  correct 
selection  corresponding  to  a  minimax  rule  for  one  of  these  loss 
functions.  However,  other  reasonable  loss  functions  yield  dif¬ 
ferent  lower  bounds. 

We  now  introduce  three  possible  such  loss  functions.  For 
each  £  €  ft  let  B(£)  *  {i | £  0Q)  denote  the  set  of  optimal 
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populations  under  also  let  R  denote  the  set  of  selected 

populations.  Define  L^,  and  as  follows: 


V*‘R) 


if  R  =  B(£) 
otherwise , 


l2c^,R) 


f  0  if  R  =  B(£)  and  £  e  ^  ££  R  3  B(£)  and  £  € 
v  1  otherwise. 


L3($’R)  =  lR-B(^>l- 

Here  denotes  set  difference  and  |a(  denotes  the  cardinality 

of  the  set  A.  penalizes  the  statistician  unless  S  contains 

exactly  the  set  of  optimal  populations.  Lg  is  a  milder  form  of 
which  dichotomizes  the  problem  into  £  e  and  0  e 
for  £  €  flg  it  penalizes  the  statistician  unless  the  empty  set 
is  chosen  while  for  £  e  it  penalizes  him/her  when  the 

selected  subset  does  not  contain  all  optimal  poulations  (but  S 


can  contain  non-optimal  populations  as  well).  Finally,  is 

the  number  of  populations  in  the  selected  subset  which  are  worse 
than  the  control.  Alternatively,  one  might  even  adopt  a  loss  func¬ 
tion  which  is,  e.g. ,  a  linear  combination  of  or  and  L3- 

We  next  introduce  no  data  rules  for  our  present  problem;  these 


are  of  the  form  £  =  (p0,p^,.. . ,pk> 
(0  <  i  <  k)  is  chosen  according  to 


where  a  subset  of  size  i 

k 

jd,  and  then  one  of  the  (.) 


subsets  of  size  i  is  chosen  at  random.  In  contrast  to  the  situ¬ 


ation  in  Section  1,  P^{CS|jd}  now  depends  on  £  €  ft  but  is  a 
constant  over  ^  e  Q.  (0  <,  j  <  k).  A  straightforward  computation 

gives  P{CS|p}  =  inf  P  { CS  j p }  =  min  ?  p.(^)/(^)  =  p.  .  For 

*  a  i  *  i<i<k  j=i  1  >x  3  k 

each  loss  (1  _<  i  3),  Table  II  lists  (1)  the  maximum  of  the 

risk  5  E^{L^(j^,S)  |j^}  for  an  arbitrary  no-data  (2)  the 

minimax  rule,  p^;  (3)  the  infimum  of  the  probability  of  correct 
selection  under  £  ,  P{CS|]D  };  and  (4)  the  minimax  risk. 


It  can  be  shown  that  the  maximum  risk  under  (L^), 
sup  ^(£,6),  (sup  R2(£,6))  is  at  least  1  -  1/2*  (1/2)  for  any 
data  dependent  rule,  6.  Hence  the  statistician  should  not  use  a 
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TABLE  II 

Minimax  Rules  and  Associated  Risk  for  (i  =  1,2,3) 


sup  R.(0,$> 

7* 

mm 

sup  R.(0,p^^) 

a  1  *  * 

Pi 

1  -  min  -r~- 
0<j<Jc  (j) 

piX)  =  "ic“  (05-iik) 

3  2* 

l/2k 

1  -  l/2k 

max{l-p0,  l-pk> 

*  va  .  p<’> 

1/2 

1/2 

k 

l  :Pn- 

j=i  : 

r  •  1 

0 

0 

data  dependent  rule  if  P*  is  chosen  less  than  or  equal  to  1/2 
(1/2).  Similarly  the  analysis  of  Lg  shows  that  P*  must  be 
positive  or  else  the  no-data  rule  should  be  used. 
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